The Discrete Infinite Logistic Normal Distribution

نویسندگان

  • John Paisley
  • Chong Wang
  • David M. Blei
چکیده

We present the discrete infinite logistic normal distribution (DILN), a Bayesian nonparametric prior for mixed membership models. DILN generalizes the hierarchical Dirichlet process (HDP) to model correlation structure between the weights of the atoms at the group level. We derive a representation of DILN as a normalized collection of gamma-distributed random variables and study its statistical properties. We derive a variational inference algorithm for approximate posterior inference. We apply DILN to topic modeling of documents and study its empirical performance on four corpora, comparing performance with the HDP and the correlated topic model (CTM). To compute with large-scale data, we develop a stochastic variational inference algorithm for DILN and compare with similar algorithms for HDP and latent Dirichlet allocation (LDA) on a collection of 350, 000 articles from Nature.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Discussion of "The Discrete Infinite Logistic Normal Distribution for Mixed-Membership Modeling"

Mixed-membership models (e.g. “topic models”) are inarguably popular; especially latent Dirichlet allocation (LDA) [Blei et al., 2003] and its variants. Such models have become a fundamental tool in the analysis and exploration of many types of data. Originally designed to model text documents as per-word draws from a document-specific weighting of a finite collection of “topics” (distributions...

متن کامل

The Discrete Infinite Logistic Normal Distribution for Mixed-Membership Modeling

We present the discrete infinite logistic normal distribution (DILN, “Dylan”), a Bayesian nonparametric prior for mixed membership models. DILN is a generalization of the hierarchical Dirichlet process (HDP) that models correlation structure between the weights of the atoms at the group level. We derive a representation of DILN as a normalized collection of gamma-distributed random variables, a...

متن کامل

Logistic–Normal Distribution‎: ‎Properties and Application

‎In this paper some properties of logistics‎ - ‎x family are discussed and a member of the family‎, ‎the logistic–normal distribution‎, ‎is studied in detail‎. ‎Average deviations‎, ‎risk function and fashion for logistic–normal distribution is obtained‎. ‎The method of maximum likelihood estimation is proposed for estimating the parameters of the logistic–normal distribution and a data set is ...

متن کامل

The Analysis of Bayesian Probit Regression of Binary and Polychotomous Response Data

The goal of this study is to introduce a statistical method regarding the analysis of specific latent data for regression analysis of the discrete data and to build a relation between a probit regression model (related to the discrete response) and normal linear regression model (related to the latent data of continuous response). This method provides precise inferences on binary and multinomia...

متن کامل

A Discrete Kumaraswamy Marshall-Olkin Exponential Distribution

Finding new families of distributions has become a popular tool in statistical research. In this article, we introduce a new flexible four-parameter discrete model based on the Marshall-Olkin approach, namely, the discrete Kumaraswamy Marshall-Olkin exponential distribution. The proposed distribution can be viewed as another generalization of the geometric distribution and enfolds some importan...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012